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Foreward 

We  describe  an  approach  for  accomplishing  the  high-level  mission  planning  required  for  a  heteroge¬ 
neous  team  of  autonomous  vehicles  performing  mine  countermeasure  (MCM)  survey  missions  in  multiple 
areas.  The  high-level  mission  scheduling  and  waterspace  management  requires  sequencing  the  order  and 
location  of  lower-level  MCM  tasks  to  be  completed  by  each  vehicle  in  the  heterogeneous  team:  unmanned 
surface  vessels  (USVs)  and  unmanned  underwater  vehicles  (UUVs).  We  propose  solving  this  complex 
sequencing  operation  by  leveraging  unique  information  processing,  communication,  refueling,  and  plan¬ 
ning  windows  that  form  constraints  within  the  system  within  a  formal  scheduling  optimization  framework 
known  as  mixed-integer  linear  programming.  We  pose  the  problem  using  a  mixed-integer  linear  program¬ 
ming  optimization  framework,  compare  several  complexity  reduction  heuristics  to  the  full  optimization, 
and  include  methods  to  account  for  relative  uncertainty  in  the  duration  of  planned  tasks  in  such  a  man¬ 
ner  to  balance  the  risk  of  schedule  slips  or  conflicts  to  the  risk  of  creating  an  overly  conservative  and 
sub-optimal  schedule. 


This  technical  report  has  been  prepared,  reviewed,  and  approved  by  the  Science  and  Technology  Depart¬ 
ment. 


BAYS.MATTHEW 
J.l  281 374881 


Digitally  signed  by 
BAYS.MATTHEW  J.l  281 374881 
DN:  c=US,  o*U.S.  Government, 
oq=DoD,  ou=PKI,  ou=USN, 
cn=8AYS.MATTHEWJ.l 281 374881 
Date:  2016.02.22  10:27:17  -06’00' 


Dr.  Matthew  J.  Bays,  Code  X22, 
Automation  and  Dynamics  Branch 


WETTERGREN.THO 


Digitally  signed  by 

WETTERGREN.THOMASA.l  230451849 
DN:c»U$,  0“U.S.  Government  ou-DoD, 

MAS.A.1 230451 849  En»WETTERC>REN.THOMASA.l  230451 S49 

. .  .  .Dm.-2016.a2.24J0*2:SS-O5W 


Dr.  Thomas  A.  Wettergren,  Code  00T1, 
Naval  Undersea  Warfare  Center,  Newport 


RODRIGUEZ.RAFA  R008WVKJWF,  A EUR  1232170472 

/  '  ON  t»lM,o«US  Government  oueDoO.ouvFffl, 

FI  R  1  1  ~7C\A~T^  oe-tWttn-*00«GUEiMFMlM2J2170472 

I  I  /  V7*T/  Z  Date  201*022  5  0H334-O6W 

Rafael  Rodriguez,  Code  X22, 

Automation  and  Dynamics  Branch 

Daniel  Kucik,  Code  X20, 

Unmanned  Systems  and  Threat  Analysis 


Dr.  Kerry  Commander,  Code  X, 
Science  and  Technology  Department 


2 


NSWC  PCD  TR  2015-003  Optimized  Waterspace  Mgt 


Contents 


1  Introduction  4 

1.1  Related  Work .  4 

2  Mine  Countermeasure  Logistics  Scheduling  using  MILP  5 

3  Constraint  Formulation  7 

3.1  Docking  and  Deployment  Constraints  .  7 

3.2  Fuel  Limitation  and  Charging  Constraints .  8 

3.3  Goal  and  Capability  Constraints .  8 

3.4  Time  Dependency  and  Wait  Constraints .  8 

3.5  Transit  and  Transport  Constraints .  10 

3.6  Survey  Constraints .  10 

3.7  Cost  Function .  11 

3.8  Reacquire  and  Identify  Scheduling  Constraints  .  11 

3.9  Neutralization  Constraints .  12 

4  Complexity  and  Complexity  Reduction  13 

4.1  Node  Reduction  Strategy .  15 

4.2  Edge  Reduction  Strategy  .  15 

5  Robust  Scheduling  Extensions  15 

6  Simulation  Results  19 

6.1  End-to-End  MCM  Mission  Scheduling .  20 

6.2  Deterministic  and  Robust  Scheduling  Results .  20 

6.3  Analysis  of  Complexity  Reduction  Heuristics  and  Computational  Tractability .  22 

7  Conclusions  24 

A  Distribution  27 


List  of  Figures 


1  Notional  illustration  of  an  MCM  mission  utilizing  unmanned  vehicles,  including  USVs  to 

transport  UUVs .  6 

2  Comparison  between  full  edge  transit  graph  and  graph  from  complexity  reduction  strategies.  16 

3  Risk  function  for  Schedule  slip  based  on  As(0,20)  with  cme ^  and  cshp  =  200.  .  .  18 

4  Plot  of  vehicle  actions  within  an  MCM  area .  20 

5  Simulated  MCM  Schedule .  21 

6  Comparison  between  deterministic  optimization,  a  naive  robust  optimization  approach,  and 

the  proposed  minimum-risk  scheduling  approach .  22 

7  Histogram  of  Monte  Carlo  simulations  showing  minimum  objective  value  found  by  different 

complexity  reduction  strategies .  23 


List  of  Tables 

1  Characterization  of  phase,  agent  count,  and  optimization  type  effects  on  optimization  quality.  24 


3 


NSWC  PCD  TR  2015-003  Optimized  Waterspace  Mgt 


1  Introduction 

The  use  of  autonomous  systems  to  perform  increasingly  complex  and  coordinated  tasks  has  necessitated  cre¬ 
ating  heterogeneous  teams  of  agents,  where  different  agent  types  specialize  in  different  parts  of  an  operation 
[1,  2,  3].  One  such  heterogeneous  team  operation  is  the  heterogenous  unmanned  system  mine  countermea¬ 
sure  (MCM)  scenario,  when  one  type  of  mobile  agent,  typically  unmanned  underwater  vehicles  (UUVs)  is 
tasked  with  performing  the  direct  surveys  and  re-acquire/identify  tasks  for  the  operation,  while  another 
type  of  larger,  faster-moving,  or  longer-range  agent,  an  unmanned  surface  vessel  (USV)  is  responsible  for 
transporting  the  UUVs  between  jobs  for  faster  completion.  Within  manned  systems,  there  are  numerous 
examples  of  the  USV/UUV  concept  such  as  aircraft  carriers  and  their  respective  aircraft,  garbage  trucks 
and  accompanying  garbage  workers,  or  mail  delivery  and  their  respective  postmen.  While  this  form  of  close 
interaction  between  unmanned  systems  is  still  far  from  common,  the  underlying  hardware  and  guidance  in¬ 
frastructure  to  allow  autonomous  docking  and  deployment  between  unmanned  systems  is  actively  researched 
[4,  5,  6,  7,  8], 

We  present  a  high-level  planner  that  takes  the  lower-level  vehicle  tasks  required  to  complete  a  typical 
MCM  survey  mission  and  sequences  them  in  a  near-optimal  manner  to  minimize  MCM  objective  functions 
such  as  mission  time  while  accomplishing  MCM  mission  objectives.  We  use  mixed-integer  linear  programming 
(MILP)  as  an  optimization  technique  for  our  approach  [9].  MILP  has  previously  been  used  for  probabilistic 
vehicle  motion  control,  and  is  capable  of  quickly  finding  near-optimal  solutions  to  constrained  optimization 
problems  involving  linear  and  integer  variables  [10]. 

The  principal  theory  developed  in  this  research  will  be  how  to  pose  the  MCM  scheduling  problem  in  a 
formal  mathematical  manner  such  that  it  can  be  input  into  an  off-the-shelf  MILP  solver,  while  allowing  for  the 
inherent  uncertainty  in  the  system.  MILP  requires  two  fundamental  components:  a  matrix  of  mathematical 
constraint  equations  that  capture  both  the  independent  variables  of  optimization,  as  well  as  linear  and 
binary  variables  that  are  dependent  on  those  optimization  variables.  The  second  component  is  the  linear 
cost  function  that  is  dependent  on  all  independent  and  dependent  variables  used  in  the  constraint  equations. 
If  these  components  are  posed  successfully  to  capture  the  dynamics  of  the  MCM  mission,  solutions  can 
be  found  relatively  quickly.  While  the  research  will  predominantly  consider  heterogeneous  teams  unmanned 
vehicles  for  the  development  of  the  optimization  constraints,  it  is  likely  this  research  could  be  applied  to  teams 
of  manned  systems  provided  similar  constraints  on  communication,  refueling,  or  other  mission  characteristics 
exist. 

1.1  Related  Work 

The  problem  of  scheduling  and  task  allocation  has  been  extensively  studied  with  a  variety  of  techniques  in 
multiple  different  forms.  Koes  et  al  developed  a  coordination  architecture  for  modeling  multirobot  coordi¬ 
nation  and  task  allocation  in  [11].  Our  framework  complements  this  methodology  by  expanding  the  types 
of  scenarios  under  which  a  constraint  optimization  framework  can  be  used:  namely  to  problems  where  there 
are  strongly  coupled  constraints  on  vehicles  such  as  transportation  actions. 

Korsah  et  al  create  a  framework  to  explore  optimal  vehicle  routing  with  cross-schedule  constraints  with 
applications  to  robotic  assistance  in  [12].  However,  there  are  several  notable  differences  to  our  work.  Firstly, 
the  principal  element  of  optimization  is  an  agent  route ,  not  the  collection  of  agent  actions  comprising  the 
route.  Additionally  in  Korsah’s  problem,  the  individuals  being  transported  along  the  routes  are  static; 
there  is  no  mechanism  within  the  optimization  framework  for  the  individuals  themselves  to  move  about  the 
area.  As  such  [12]  is  similar  to  the  traditional  vehicle  routing  problem  (VRP)  [13].  Similarly,  Mathew  et 
al  addresses  finding  the  shortest  route  for  a  unmanned  ground  vehicle  (UGV)  and  unmanned  aerial  vehicle 
(UAV)  pair  to  transit  and  make  deliveries  to  locations  only  reachable  by  the  UAV  in  [14],  However,  their 
work  focuses  primarily  on  the  overall  path  planning  for  the  two  vehicles,  and  not  on  fuel  limitations  and 
the  higher- level  scheduling  aspects  required  when  there  are  an  arbitrary  number  of  UAVs  and  UGVs.  Our 
work  takes  a  scheduling-centric  approach,  formally  incorporates  fuel  constraints,  and  is  generalized  for  an 
arbitrary  number  of  UUVs  and  USVs. 

Gombolay  et  al  develop  a  centralized  algorithm  to  efficiently  schedule  manufacturing  processes  using 
robotic  teams  in  [15].  However,  they  primarily  focus  on  the  development  of  a  task  sequencing  heuristic 

DISTRIBUTION  STATEMENT  A.  Approved  for  public  release;  distribution  is  unlimited. 
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that  allows  the  generalized  problem,  a  Simple  Temporal  Problem  [16],  coupled  with  simple  spacio-temporal 
constraints,  to  be  tractable  for  large  numbers  of  tasks  and  robots  to  be  assigned  to  a  fixed  set  of  tasks. 
Our  principal  contribution  is  in  the  unique  modeling  and  constraint-based  logic  required  for  the  docking, 
transport,  and  deployment  of  unmanned  vehicles  throughout  the  environment.  The  development  of  the 
non-trivial  constraints  within  our  framework  allows  for  applications  where  a  list  of  required  tasks  is  not 
given,  but  instead  a  variety  of  different  methods  are  available  to  the  vehicles  to  collectively  complete  a  set 
of  higher-level  objectives  by  the  vehicles’  interaction. 

Previous  work  specifically  for  MCM  scheduling  has  typically  focused  on  the  task  allocation  problem 
[17,  1].  For  example,  several  efforts  have  been  undertaken  to  determine  optimal  partitioning  of  search  areas 
for  undersea  search  with  various  assumptions  on  sensor  modeling  and  false  alarm  information  in  [18,  19].  Our 
work  extends  these  results  by  focusing  on  the  when  and  how  to  optimally  have  vehicles  prosecute  different 
mission  areas,  as  opposed  to  simply  where  vehicles  should  be  tasked.  More  recently,  work  by  Molieneaux  et  al 
have  developed  algorithms  to  aid  in  schedule  repair  for  previously-developed  MCM  operation  schedules  using 
a  cognitive  architecture  framework  [20] .  With  their  framework,  the  authors  attempt  to  repair  user-designed 
MCM  schedules  that  have  become  infeasible  due  to  unforeseen  circumstances  such  as  asset  breakdown  or 
tasking  delays.  In  our  work,  we  attempt  to  solve  the  initial  step  by  providing  a  formal,  automated  method 
to  plan  an  overall  MCM  schedule  from  a  mathematical  model  of  generalized  MCM  operation  constraints. 

General  robust  scheduling  techniques  have  been  studied  using  a  variety  of  methods.  Lin,  Janak,  and 
Floudas  develop  complimentary  approaches  to  formally  incorporate  uncertainty  in  scheduling  problems  in 
[21]  and  [22].  In  their  work,  Lin  et  al  develop  formal  methodologies  for  converting  MILP  scheduling  problems 
where  model  parameters  are  uncertain  under  either  a  bound  or  known  probability  distribution  into  another 
MILP  or  MINLP-type  problem  where  the  uncertainty  is  explicitly  incorporated  into  the  optimization  frame¬ 
work,  yielding  a  robust  solution  that  still  meets  the  problem  constraints  with  a  fixed  probability.  In  a 
similar  manner,  Bertsimas  and  Sim  create  a  method  to  adjust  the  conservatism  of  robust  MILP  solutions 
in  [23]  while  keeping  the  robust  problem  linear.  Other  work  on  robust  scheduling  and  integer  programming 
problems  in  the  general  sense  may  be  found  in  [24,  25,  26].  However,  to  our  knowledge  there  is  no  work 
explicitly  incorporating  Bayes  risk  to  schedule  slips  nor  for  the  application  of  MCM  scheduling.  Our  work 
has  the  similar  goal  for  the  specific  problem  of  MCM  scheduling,  but  instead  of  robustly  optimizing  with  a 
fixed  uncertainty  of  meeting  time  constraints  or  an  acceptable  number  of  constraints  that  may  be  violated, 
we  incorporate  a  cost  function  derived  from  Bayes  risk  using  an  adjustable  cost  of  a  schedule  slip  and  the 
potential  increase  in  buffer  time  for  a  task. 

This  paper  is  outlined  as  follows:  in  Section  2,  we  discuss  the  general  setup  of  the  the  MCM  scheduling 
problem,  including  the  definition  of  the  theoretical  base  problem  we  call  the  service  agent  transport  problem 
(SATP).  In  Section  3,  we  outline  the  mathematical  constraints  that  make  up  the  framework.  We  propose 
several  heuristics  in  order  to  limit  the  computational  complexity  in  4.  We  discuss  robust  scheduling  extensions 
that  utilize  a  cost  function  based  on  Bayes  risk  which  adds  buffer  times  in  between  tasks  into  the  system  in 
Section  5.  Finally,  simulation  results  showing  the  performance  of  various  implementations  of  the  SATP  are 
discussed  in  Section  6. 


2  Mine  Countermeasure  Logistics  Scheduling  using  MILP 

We  will  now  outline  an  MCM  scheduling  problem  by  developing  a  formal  mathematical  framework  with 
which  to  ultimately  optimize  a  vehicle  schedule.  Figure  1  shows  a  notional  illustration  of  the  survey  UUV 
transport  problem.  Let  there  exist  a  set  of  autonomous  vehicles  A  =  {1,...,A}  that  are  tasked  with 
surveying  a  number  of  survey  areas  S  =  {1, . . . ,  S}.  The  survey  areas  are  connected  by  a  directed  graph 
(V,  V).  The  nodes  T>  =  {o,  1, . . . ,  D}  represent  locations  at  which  an  agent  a  &  A  may  enter  a  survey  area 
s  £  S  in  order  to  perform  operations.  The  node  labeled  o  is  the  origin  node,  which  represents  the  starting 
point  for  all  valid  paths  in  the  graph.  The  agents  may  move  from  node  i  to  node  j  along  an  edge  of  the 
graph  v  =  ( i,j )  £  V.  The  subset  Ds  C  T>  represents  nodes  from  which  agents  may  survey  area  s. 

The  use  of  graph  representations  of  scheduling  problems  has  been  historically  used  effectively  due  to  the 
ability  of  the  methods  to  directly  lead  to  effective  heuristic  decomposition  procedures  [27]  that  can  greatly 
reduce  the  computational  complexity  of  the  optimization.  Structural  simplifications  of  the  graph  structure 
may  be  used  to  create  heuristics  on  the  survey  scheduling;  for  example,  the  resulting  optimized  schedule  is 
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expected  to  be  from  the  subset  of  acyclic  paths  within  the  graph  structure.  Thus,  reducing  the  problem 
size  by  decomposing  the  graph  structure  into  a  directed  acyclic  graph  structure  is  a  reasonable  complexity 
reduction  strategy;  such  an  approach  is  considered  in  Section  4. 
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Figure  1:  Notional  illustration  of  an  MCM  mission  utilizing  unmanned  vehicles,  including  USVs  to  transport 
UUVs. 

The  vehicles  are  separated  into  three  mutually  exclusive  and  collectively  exhaustive  subsets  Aur,  Ar,  and 
As  consisting  of  survey  UUVs,  reacquire  and  identify  (RI)  UUVs,  and  USVs,  respectively.  A  survey  UUV 
to  £  Aur  is  capable  of  directly  surveying  the  survey  areas.  An  RI  UUV  r  £  Ar  perform  RI  actions  on  areas 
previously  surveyed  and  in  which  MILCOS  have  been  found.  A  USV  n  £  As  may  collect  a  fixed  number 
of  the  UUVs  and  transport  them  between  nodes.  For  some  of  the  below  constraints,  we  also  wish  to  define 
the  union  of  survey  UUVs  and  RI  UUVs  as  Aur  =  Au  U  Ar ■  The  UUVs  may  move  between  nodes  with 
a  time  cost  c™jove ,  however  are  significantly  slower  than  USVs  with  transportation  cost  of  c-™ns  <  c™jove. 
The  USVs  also  require  a  fixed  amount  of  time  to  collect  and  deploy  the  survey  UUVs.  The  collection  and 
deployment  costs  are  denoted  cdock  and  cdeploy ,  respectively.  The  vehicles  initially  start  at  an  origin  node  o, 
with  each  USV  n  containing  An  UUVs. 

We  assume  it  is  known  due  to  enemy  course  of  action  (ECOA)  intelligence,  environmental  information, 
and  other  sources  that  there  is  an  expected  number  of  mine-like  contacts  (MILCOs)  N^IILCO  before  com¬ 
pletion  of  survey  actions  within  area  s,  as  well  as  an  expected  number  of  mines  N™mes  before  completion 
of  RI  actions  in  area  s.  Upon  completion  of  the  survey  and  RI  actions,  these  quantities  and  their  expected 
locations  will  be  known,  and  denoted  as  j\Wf/ico  and  jV™"65,  respectively.  Until  the  locations  are  known 
from  the  actual  survey  and  RI  actions,  we  assume  the  location  of  the  MILCOs  and  mines  follow  density 
functions  pmilco{s)  and  pmine{s).  Using  the  expected  number  of  MILCOs  and  mines,  we  assume  that 
expected  time  costs  for  performing  RI  and  neutralize  actions  can  be  estimated  for  each  survey  area  s.  We 
denote  the  expected  RI  and  neutralization  costs  as  c^rs  and  using  vehicle  r  or  n  on  survey  area  s. 

We  outline  the  general  logistical  constraints  for  an  operation  where  multiple  UUVs  may  be  transported 
throughout  the  held  via  an  USV  based  largely  on  [28].  We  then  expand  the  logistical  constraints  with  the 
specific  constraints  developed  for  general  survey,  reacquire  and  identify  (RI),  and  prosecution  tasking  such 
as  found  in  MCM  operations. 

The  primary  optimization  variables  for  our  specific  solution  to  the  SATP,  which  we  call  decision  variables, 
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fall  into  two  groups:  scheduling  indicator  variables  and  time  variables.  The  scheduling  indicator  variable 
I a,p,c  e  {0,  1}  denotes  that  task  type  k  is  scheduled  for  completion  by  vehicle  a  during  phase  p  at  location  set 
l  €  Ck,  where  Ck  is  the  set  of  all  feasible  locations  required  for  execution  of  task  type  k  and  {JkGlc£k  =  £■ 
The  timing  variables  £  R  and  £  R  denote  the  times  at  which  the  task  of  vehicle  a  during 

phase  p  is  scheduled  to  start  and  end,  respectively.  In  contrast  to  the  decision  variables,  additional  variables 
are  needed  for  the  optimization  problem  in  order  to  develop  the  necessary  constraints  to  formulate  the 
optimization  problem.  While  the  additional  variables,  which  we  denote  as  helper  variables  are  technically 
variables  used  in  the  optimization,  they  are  not  of  primary  concern  for  the  optimization  problem.  We  will 
discuss  the  helper  variables  as  the  details  of  the  optimization  problem  are  formulated. 


3  Constraint  Formulation 

We  now  turn  to  defining  the  constraints  required  for  the  mathematical  formulation  of  the  MCM  scheduling 
problem  pertaining  to  the  survey  constraints  and  logistics  management.  The  following  constraints  were 
originally  characterized  within  the  SATP  in  [28]. 


3.1  Docking  and  Deployment  Constraints 

The  first  constraint  within  the  SATP  is  that  a  UUV  must  only  complete  certain  tasks  when  they  are  docked 
or  deployed,  as  appropriate.  For  enforcing  this  constraint,  we  create  the  helper  variables  to  track  whether  a 
survey  UUV  m  is  docked  or  deployed  from  a  USV  n.  We  denote  this  variable 

Y=.  _  J 1  UUV  m  is  docked  with  USV  n  in  phase  p  ,  , 

mn’p  -  Otherwise 

Within  an  integer  programming  framework,  we  create  (1)  with  the  formal  constraint 

Vm  £  Aur,Vn  £  As,Vp  £  V  (2) 

j dock  _  j deploy 

Lmn,p',d  *-mn,p' ,d  ’ 

p'<p,d£D 


D 


mn,p 


=  E 


where  d  and  I mn,p\d’  are  indicator  variable  docking  and  deploying,  respectively,  at  particular  dock/deploy 

nodes  d.  We  bound  Dmrt)P  with  the  constraint 

Vm  €  Aur,\/n  £  As,\/p  £  V  0  <  Dm„,p  <  1.  (3) 

With  constraints  (2)  and  (3),  DTOn>p  equals  1  if  UUV  m  is  currently  docked  with  USV  n  during  phase  p, 
and  0  otherwise.  Constraint  (2)  and  the  lower  bound  in  (3)  satisfies  the  additional  requirement  that  in  order 
for  a  survey  UUV  to  be  deployed,  it  must  first  have  been  docked.  It  is  also  necessary  to  track  if  an  UUV  is 
docked  or  deployed  in  general.  This  is  performed  using  the  helper  variable  DmniP  and  constraints 


V?n  £  Aur,Vp  £  'P  Dm  p  —  'y  ]  Dmn  p  (4) 

n€As 

and 

Vto  €  Aur,  Vp  G  V  0  <  Dm;P  <  1  (5) 

(6) 


Additionally,  we  limit  the  total  number  of  UUVs  that  can  be  docked  at  a  given  time  by  a  USV  to  Dmax 
using  the  constraint 

Vn  £  As,Vp  £  V  ^  v  Dmn  p  Dmax  (7) 

m&A-u  r 

With  constraints  (4)-(7),  we  can  now  limit  the  feasible  actions  of  the  individual  vehicles  when  deployed  or 
docked  as  appropriate. 
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3.2  Fuel  Limitation  and  Charging  Constraints 

Fuel  consumption  is  an  additional  variable  that  must  be  tracked  for  each  vehicle  during  all  phases  of  the 
mission.  For  tracking  fuel,  we  define  the  variable  Fm  p  €  R  as  the  amount  of  fuel  survey  UUV  m  has  available 
during  the  start  of  phase  P.  The  variable  FTOjP  is  assigned  a  value  using  the  constraints 


Vm  e  Aur,Wp  e  V 


F  m,p  — 


p'<p 


charge 
'  m,p' 


seS 


survey -rsurvey 
m,s  1m,p/  ,s 


El  rdock-wdock  ,  ndeploy-r  deploy  \ 
l  Jm,d  Lm,p' ,d  '  Jm,d  Lm,p',d ) 


dev 


Exmove  y move 
J  m,p'  ,v*"m,p'  ,v 

vev 


(8) 


where  f^vey,  fmff,  fmPd°V >  an<l  fm°ue  are  fixed  costs  for  executing  their  respective  goals,  and  F^“rffe  is  a 
floating  variable  indicating  the  amount  a  vehicle  is  charged  during  a  phase.  The  variable  F^“r9e  is  fixed  to 
its  value  using  the  linear  survey  UUV  charge  rate  c^ar9e  and  the  constraints 


Vm  G  Aur,  Vp  GV 


-py  _  -|  _v  -recharge  ^  charge  (rpend 

Urn, p  —  1  r  m,p  —  ("rri  \  m,p 

t>m,P  =  o  =►  F^;9e  =  o. 


rpstart\ 
m,p  ) 


(9) 


Additionally,  we  limit  the  fuel  capacity  with  the  constraints 


Vm  G  Aur,  Vp  G  V 

F  <  F 

L  m,p  —  1  max 

(10) 

Vm  G  Aur,  Vp  G  V 

0  ^  F  p 

(11) 

3.3  Goal  and  Capability  Constraints 

In  our  formulation,  each  vehicle  can  only  prosecute  one  goal  during  a  phase  of  the  mission.  As  such,  we  have 
the  constraint 

Va  €  A,\/p  G  V  (12) 

keKL  leC'k 

As  not  every  vehicle  is  capable  of  completing  every  task,  this  requires  additional  constraints  to  force  only 
vehicles  capable  of  completing  a  task  to  be  scheduled  for  the  task.  As  such,  we  have  the  constraint 

VaiAk,VpeV,Vl€Ck  I*P)i  =  0,  (13) 

where  Ak  is  the  set  of  vehicles  that  are  capable  of  action  k.  We  note  that  this  constraint  may  also  be 
implemented  for  tractability  by  limiting  both  the  scheduling  variables  I*  l  and  constraints  specific  to  a 
certain  capability  that  are  created  in  the  optimization  problem  to  those  allowed  by  the  individual  vehicle’s 
capabilities. 

3.4  Time  Dependency  and  Wait  Constraints 

We  now  turn  to  the  constraints  required  for  determining  the  specific  times  at  which  phases  are  scheduled 
for  starting  and  ending.  Without  loss  of  generality,  we  assume  that  the  start  time  of  the  initial  phase  is  at 
t  =  0.  As  such  we  have  the  constraint 


Ma&A  TSaap=o  =  0. 


(14) 
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In  order  to  adhere  to  the  boundary  conditions  between  the  end  of  the  last  phase  and  the  start  of  the  next 
phase,  we  have  the  constraint 


Va  G  A,  Mp  G  V  |  p  ±  0  T",  >  (15) 

With  constraints  (14)  and  (15),  we  have  defined  all  variables  with  the  exception  of  the  end  time  of  each 
phase,  .  The  end  time  may  be  defined  by  taking  advantage  of  (12)  with  only  one  task  being  scheduled 
for  a  vehicle  during  a  phase 

Va  G  A,  Mp  G  V  T™d  =  Ttr  +  E  E  (16) 

kefC  leC*k 

where  L  is  the  duration  of  task  k  if  performed  by  vehicle  a  on  location  set  L.  Additionally,  the  dual 
relationship  between  a  UUV  and  USV  performing  docking  and  deployment  constraints  necessitates  that  we 
must  ensure  that  those  particular  operations  occur  at  the  same  time  for  both  vehicles.  Thus,  we  have  the 
time  dependency  constraints 


Mm  G  Aur,Mm  €  As,Mp  G  V 


Ejdock  _  -|  restart  _  restart 

*-mn,p,d  =^>  *-m,p  *~n,P 

dev 

Mm  G  Aur,Mm  G  As,Mp  G  V 


Ejdeploy  _  rpend  _  rpend 

*-mn,p,d  1  ^  *-m,p 

dev 


(17) 


(18) 


Some  scheduling  situations  necessitate  an  vehicle  loitering  or  otherwise  performing  no  meaningful  actions 
during  a  given  phase  so  that  it  may  properly  coordinate  with  the  actions  of  other  vehicles.  For  example,  a 
USV  may  need  to  wait  at  a  docking  point  during  a  phase  in  which  a  survey  UUV  is  servicing  an  area  with 
which  it  will  dock  in  the  proceeding  phase.  In  this  case,  we  require  an  additional  variable  indicating  that  a 
vehicle  is  idle  during  that  phase  at  a  given  location.  We  define  the  1“’“''*,  indicator  variable  by  the  constraints 


Ma  G  A,Mp  G  V,Md  G  V 


E 


keic  Lie£k 


V  T' 

/  ;  a,p,l 


E 

d!  G  V\d'  ^  d 


j wait  _  1  _ v  t wait  _  r\ 

La,p,d'  ~  1  La,p,d  ~  u 


Ma  G  A,Mk  G  IC,Mp  G  V,Md  G  V 


E 

fce/c 


L  die 


Et k  I  ,  V"  T  wait 

La,p- 1,. 

I  M  £fc| 


i  \  r wait  _  I  ,  t wait  

“T  2_^  La,p-l,d'  —  1  ^  La,p,d  ~  u 


d!  G  V\ 
d!  ^  d 


Ma  G  A,Mp  G  V  E  E  1Ip,i  = 0  ^  E  W  =  !■ 

keKL  lee*]?,  dev 


(19) 


(20) 


(21) 


Constraint  (19)  forces  to  equal  zero  if  the  vehicle  is  otherwise  tasked  or  waiting  at  another  location. 

Constraint  (20)  forces  I to  equal  zero  if  the  vehicle  performed  an  action  at  a  different  location  d'  during 
the  previous  time  step,  thereby  prohibiting  the  vehicle  from  waiting  at  location  d.  Constraint  (21)  forces 
1“ pltd  to  one  if  the  vehicle  is  not  otherwise  tasked. 
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3.5  Transit  and  Transport  Constraints 

We  now  turn  to  the  constraints  required  for  transiting  vehicles  and  transporting  them.  In  order  for  a  vehicle 
to  transit  from  one  location  to  the  next,  it  must  have  performed  an  action  at  the  previous  location 


Vm  G  Aur,Mn  G  As, Mp  €  V,Md0  €  D 


Ej  move 

Am,p,(do,d) 

dev 


=  1 


+ 


Ej  move  I  \  ^  j  deploy 

*-m,p—l,(d',d0)  '  /  ;  Lmn,'p—l,dQ 


d'eV 

E 


neN 

j  survey  ,  jwait  _  -i 

*~m,p—l,s  '  *-m,p—  l,d  A  ' 


seS  |  doeVs 


(22) 


where  (do ,  d) ,  (d1 ,  do)  G  V.  Additionally,  we  have  the  constraint  that  for  a  vehicle  to  either  move  from  one 
area  to  the  next  or  to  survey  an  area,  the  vehicle  must  be  deployed 

Vm  G  Aur ,  Mp  G  V  Y.  =  1  =►  Dm,p  =  0  (23) 

v£V 


For  a  UUV  m  to  be  docked  with  USV  n,  both  vehicles  must  have  been  located  previously  at  the  location 
of  the  capture.  Thus,  we  have 


Vm  G  Aur,Mn  G  As,Mp  G  V,Md  G  V 


j  dock  _  -j  \  ^  j  move  ,  j  wait 

*-mn,p,d  1  ^  /  j  *-m,p—l,(do1,d)  ' 


+  E 


j survey 
Am,p-l,s 


seS  |  dev  s 


=  i 


j  dock  _  -j  \  ^  j  move 

*-mn,p,d  1  ^  /  j  *-n,p—l,(do1,d) 

do1  eD 


I  jwait 
'  *-n,p—  l,i 


+  E 

ml  G  Aur  I 
m!  ^  m. 


T  dock  I  y  deploy 

Lm'n,p—l,d  '  1m'n,p—  l,d 


>  1 


(24) 


(25) 


where  (dot,d),  ( do2,d )  G  V.  Likewise,  we  have  the  constraint  for  deploying  UUVs  as 

Vm  G  Vlur,Vn  G  As,Mp  G  P,Md  G  I? 

^deploy  1  .  -p.  _  -1 

lmn,p,d  —  r  ^  umn,p  —  J- 

rdeploy  _ 


jaeptoy  _  -i  ,  \  ^  jrno-ue 

Amn,p,d  1  ^  ^  Vn,p- l,(d0l  ,d) 


yiuait 
An,p—  l,d 


^Oi  G-D 

E 

m'  G  Aur  I 
m'  ^  m 


j  dock 

*-m'n,p—  l,d 


deploy 


I  f 


m'n,p—l,d 


>  l 


(26) 

(27) 


3.6  Survey  Constraints 

The  first  constraint  for  a  servicing  action  is  that  a  survey  area  can  only  be  surveyed  if  it  is  scheduled  after 
a  deployment  action  at  a  node  d  G  Ds  or  move  action.  Thus,  we  have  the  constraint 

Mm  G  Au,Mn  £  As,Mp  £V,Md  £V,Ms  £  S  (28) 

j  survey  _  -i  _ ,  \  ^  y  deploy  ,  \  A  jmove  _  i 

*-m,p,s  1  ^  ^  Amn,p-l,d  '  /  >  Am,p—  l,(d',d)  -*-* 

n£*4s  d' eD 
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where  (d! ,d)  G  V  Next,  we  have  the  constraint  that  a  survey  action  can  only  be  performed  if  it  is  scheduled 
while  the  UUV  is  deployed 


Vm  g  Aur  ,\/peV  J2  =  1  =►  =  0 


s£«S 


Additionally,  every  survey  goal  should  be  completed  exactly  once  by  the  group  of  vehicles.  Thus, 

Vse5  E  E1™^1- 

m£Au  P&V 


(29) 


(30) 


3.7  Cost  Function 

The  cost  function  to  be  minimized  in  our  initial  formulation  is  to  minimize  the  end  time  of  the  last  phase 
over  all  vehicles.  That  is,  our  objective  is 


minimize  maxT“p. 

a  ’ 


(31) 


3.8  Reacquire  and  Identify  Scheduling  Constraints 

An  example  of  such  mixed-integer  linear  program  constraints  is  shown  in  the  following.  Let  {p,p,,p,,| 
represent  indices  in  the  set  of  mission  phases  V.  Index  s  iterates  over  the  set  S  of  survey  areas,  and  d  G  V 
represents  the  set  of  dock/deploy  points  for  the  mission  where  UUVs  may  onboard  or  offboard  the  USV. 
Three  criteria  must  be  met  prior  to  the  RI  action: 

1.  the  area  s  must  have  been  surveyed  by  a  survey  UUV, 

2.  the  survey  UUV  m  from  survey  UUV  set  Au  must  have  been  docked  with  a  USV, 

3.  sufficient  time  must  have  elapsed  from  when  the  survey  UUV  had  docked  with  a  USV  to  perform 
post-mission  analysis  (PM A). 

The  first  two  criteria  may  be  captured  by  the  constraint 


Vr  G  Ar,Vp  G  P,Vs  G  S 


iRI  =i=*.  v 

r,p,s  7  /  j 


m^Au 


E 

p'<p 


Et dock  \  i 
*-m,p'  ,s  —  ^ 


neAs,deD 


a  T survey  > 

/x  /  j  ±m,p",s  —  1 

p"<p' 


>  l. 


(32) 


where  \RIp  s  is  the  action  indicator  variable  for  scheduling  RI  UUV  r  performing  an  RI  action  during  phase 
p  in  area  s.  In  order  to  capture  the  third  criteria,  we  create  the  constraint 


Vn  G  As,Vp  G  Vydn  G  Vn  (33) 

r RI  _  -|  rpstart  ^  rr\PM A, end 

*-n,p,dn  ^  =^>  "*■  n,p  —  -*-5  5 
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where  T^AgA’end  is  a  new  time  indicator  variable  indicating  the  time  PMA  has  ended  for  the  RI  component 
of  an  operation  on  survey  area  s,  which  is  defined  using  the  constraints 


\/n  g  -4s,Vp  eV,VseS 


E 

m£Au. 


E-rsurvey  ^  i 
^m^p'  ,s  — 

p'<p 


A  \ldock  >  ll 

1  '  L  m,p,s  —  XJ 


>  1 


rpPM  A, start  rpend 

*~n,p,s  *~niP 

Vn  G  -4s,Vp  G  V,\/s  G  S 


E 

MeAu 


PM  A, start 


Et survey 

^m^p'  ,s  — 

p'<p 


A  \ldock  >  ll 

’  x  L  m,p,s  —  J 


=  o 


T 

Vs  G  5 

rpPM A, start  min  rpPM A, start 

~  neA™™,Pev  n’p’s 

Vs  G  5 

ry\P  M  A,end  rpPM  A, start  ,  p,PMA 


where  Tblg  is  a  large  constant. 


(34) 


3.9  Neutralization  Constraints 

We  now  turn  to  determining  a  proper  schedule  for  the  neutralization  phase  of  a  mission.  In  the  scheduling 
of  neutralization  activities  of  a  given  survey  area  s,  three  criteria  must  be  met  prior  to  the  Neutralization 
action: 

1.  the  reacquire  and  identification  task  must  have  been  performed  on  area  s  by  an  RI  UUV  r  G  Ar, 

2.  the  RI  UUV  r  must  have  been  docked  with  a  USV, 

3.  sufficient  time  must  have  elapsed  from  when  the  survey  UUV  had  docked  with  a  USV  to  perform 
post-mission  analysis. 

The  first  two  criteria  may  be  captured  by  the  constraint 


v?i  g  As ,  Vp  g  vydn  G  Vn 


y  neut 
*~n,p,dn 


=  i-E 

reAr 


Et dock 

*-rn,p'  ,s 


p'<p 


.nGAs 


>  1 


A 


V  lRI"  >  1 

/  j  r,p  ,s  — 

p"<p' 


>  i, 


(35) 


where  I”e^  is  the  action  indicator  variable  for  scheduling  USV  n  to  perform  an  neutralization  action  during 
phase  p  at  the  neutralization  point  dn  G  T>n.  In  order  to  capture  the  third  criteria,  we  create  the  constraint 


VnGds,VpG?,Vd„  GD„  (36) 

j  neut  -i  rp  start  \  rpPM  A' , end 

*-n,p,dn  ^  =?>  *~n,P  —  s  ? 
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where  T.^A9r4  ,end  is  a  new  time  indicator  variable  indicating  the  time  PMA  has  ended  for  the  RI  component 
of  an  operation  on  survey  area  s ,  which  is  defined  using  the  constraints 


V?i  G  As,Vp  G  V,  Vs  G  S 


(37) 


E 

r£Ar 


VF,  >1 

/  y  r,p',s  — 

p'<p 


A  ridocfe  >  ii 

L  r*,p,s  —  ■LJ 


>  l  => 


rpPM A'  ,start  _  rpend 

V?r  G  As,Wp  G  V,\/s  G  S 


E 

r£Ar 


VF,  >i 

/  j  r,p',s  — 

p'<p 


A  ridocfe  >  ii 

L  r,p,s  —  J 


=  o  => 


PM  A'  .start 


—  r^big 


T 

Vs  G  5 

f-nPMA' , start  min  rpPM A' , start 

~  neA™™,Pev  n’p’s 

Vs  G  5 

rj^PMA/,end  rpPMA/, start  _|_  £ PMA' 


With  the  above  constraints,  we  can  incorporate  the  scheduling  of  neutralization  sorties  within  our  scheduling 
algorithm.  However,  in  order  to  incorporate  the  physical  constraints  of  allowing  a  USV  to  travel  to  a  location, 
and  requiring  the  transit  for  a  neutralization  action,  we  must  include  the  following  constraints.  First,  we 
require  that  in  order  for  a  USV  to  neutralize  a  target,  it  must  have  previously  traveled  to  that  target, 
represented  by  the  constraint 


Vn  G  As,Wp  G  V  s.t.p  ^  l,Vdn  G  T)  (38) 

j neut  -|  v  \  'v  jmoven  -i 

Ln,p,d„  ~  1  ^  .  An,p—  l,d' 

d'n£T>n  s.t.  <d^,d>GV,i 

Secondly,  in  order  to  allow  transits  between  dock/deploy  nodes  T>  and  neutralization  nodes  T>n,  we  require 
the  constraint 


Vn  G  As,\/p  G  V  s.t.p  ^  1  ,Vd„  G  V„ 


Y  d'n  e  vn  s.t.  <  d'n,  dn  >g 


move-n 
dn 


=  1  => 


E 


jmoven  _  1 

Ln,p-l,d'  ~ 


d'  FDn  s.t.  <d'  ,dn>€Vn 


(39) 


We  now  conclude  the  constraints  that  implement  neutralization  actions,  and  turn  to  the  overall  cost  function 
used  in  the  optimization  process. 


4  Complexity  and  Complexity  Reduction 

Now  that  we  have  fully  modeled  the  MCM  scheduling  problem,  we  now  discuss  initial  methods  to  limit  the 
search  space  required  by  MILP  solvers.  The  full  optimization  is  NP  hard,  which  can  be  shown  with  the 
following  proofs  based  on  the  formal  SATP  framework. 

Proposition  4.1.  The  SATP  belongs  to  the  class  NP. 

Proof.  We  first  pose  the  corresponding  decision  problem  to  the  optimization  problem  constructed  from 
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constraints  (2)-(30)  and  cost  function  (31)  as  a  formal  language. 

SATP  =  { (V,  V),  c,  M,  N,  l,  /(•),  P,  T'}  (40) 

/  :  £>  -X  S 

r'eK 

M  survey  UUVs  and  N  US  Vs  are 
assigned  tasks  during  phases  V  =  {1, . . . ,  P} 
such  that  constraints  (2)-(30)  hold  with 
maxT ®”p  of  at  most  T' . 

Next,  consider  a  two-input,  polynomial  time  algorithm  A(-)  that  can  verify  SATP.  One  of  the  inputs  to 
A(-)  is  an  implementation  of  constraints  (2)-(30).  The  other  input  is  a  binary  assignment  of  the  decision 
variables  I„  Dm>p,  Fm}P,  and  T(]”f .  We  construct  algorithm  A  as  follows:  for  each  instantiation  of 

the  constraints,  we  check  to  see  if  the  constraints  hold  in  polynomial  time.  For  implementation  of  constraints 
(9),  (17)- (30),  we  first  convert  them  to  a  standard  MILP  implementation  using  techniques  such  as  Big-M 
[29].  The  conversion  can  also  be  done  in  polynomial  time.  The  algorithm  then  finds  the  maximum  value  of 
T ®"p,  and  checks  to  see  if  it  is  at  most  T' .  This  process  certainly  can  be  done  in  polynomial  time.  □ 

Proposition  4.2.  The  SATP  is  NP-complete. 

Proof.  To  prove  the  SAT  is  NP-Complete,  we  must  show  that  it  1)  belongs  to  NP,  and  2)  is  polynomial-time 
reducible  to  another  problem  that  is  NP-Complete  [30].  Proposition  4.1  satisfies  the  first  criterion.  For 
the  second  criterion,  we  will  use  the  Traveling  Salesman  Problem  (TSP),  which  has  been  shown  to  be  NP- 
Complete  [30].  We  will  now  show  that  SATP  is  polynomial  time-reducible  to  TSP ,  written  SATP  <p  TSP. 
Let 


TSP  =  {(Vt,Vt),ct,T'}  (41) 

q  :  V  x  V  ->  K 
T'eR 

(Vt,  Vt)  has  a  tour  of  at  most  Tt. 

We  first  construct  an  algorithm  ft  :  TSP  — >  SATP  as  follows.  Since  we  make  no  restriction  on  node  d,  £  T>s 
being  exclusive  to  survey  area  s  €  S  within  SATP  we  assign  the  edges  of  T>t  to  map  one-to-one  to  individual 
survey  areas,  all  survey  areas  reachable  by  UUVs  and  transport  agents  by  all  edges,  and  one  of  the  survey 
areas  to  serve  as  the  origin  node  o.  Thus,  there  are  S  =  \T>t\  survey  areas  in  the  SATP  equivalent.  The 
standard  TSP  problem  assumes  a  route  for  only  one  agent.  We  therefore  restrict  all  mappings  of  TSP  to 
the  case  where  N  =  M  =  1  within  the  new  SATP  problem.  Likewise,  we  may  consider  the  ’phases’  within 
the  TSP  as  equal  to  twice  the  number  of  vertices  (one  each  for  transit  and  survey),  plus  two  for  docking 
and  deployment  of  the  single  survey  agent  by  the  single  USV.  Thus,  we  have  P  =  \2T>t  \  +  2.  For  clarity,  we 
will  remove  the  now  superfluous  subscripts  m  and  n.  Let  cdock  =  cdeploy  =  csurvey  =  0  in  the  new  SATP. 
Let  fssurvey  =  f$ock  =  fdeploy  =  f™ove  =  ccharge  =  0.  Let  c%ove  =  djtrans  for  all  <i,j  >=  v  G  V.  These 
assignments  can  be  done  in  polynomial  time. 

We  now  show  that  TSP  has  a  solution  of  at  most  T'  if  and  only  if  the  mapped  SATP  has  a  solution 
of  at  most  T'  as  well.  Suppose  TSP  has  a  solution  of  at  most  T' .  The  assignment  of  fuel  costs  to  zero 
causes  constraints  (8)-(10)  to  become  trivial,  and  the  combination  of  cdock  =  cdeploy  =  0  and  c™ot’e  =  cVans 
cause  there  to  be  no  difference  in  servicing  time  between  a  UUV  moving  or  being  transported  and  deployed 
between  survey  areas  via  (16).  The  assignment  P  =  2\Vt  \  +  2  cause  the  only  feasible  schedules  to  be  those 
involving  an  initial  deployment  of  the  single  UUV,  followed  by  a  TSP  tour  of  alternating  UUV  transits 
and  servicing  actions  followed  by  a  single  docking,  due  to  constraints  (2)-(7)  and  (30).  However,  the  equal 
transport/move  cost  and  zero  dock  and  deploy  cost  assignment  causes  (31)  to  degenerate  into  the  simple 
tour  cost  of  TSP.  Thus,  the  resulting  SATP  has  a  minimum  time  of  at  most  T'  as  well.  Next,  suppose  that 
SATP  has  a  completion  time  of  at  most  T' .  Then,  since  each  survey  node  must  have  been  visited  exactly 
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once  due  to  the  choice  of  P,  and  all  costs  are  zero  with  the  exception  of  transit  costs,  the  corresponding  TSP 
problem  has  a  tour  cost  of  at  most  T'  as  well.  □ 

We  now  present  two  strategies  for  increasing  the  tractability  of  the  SATP  and  thereby  MCM  scheduling 
problem  by  limiting  the  required  number  of  variables  needed  in  a  particular  SATP  implementation.  The 
first  strategy  reduces  the  number  of  candidate  dock,  deploy,  and  transit  nodes  to  the  closest  S  nodes  that 
provide  one  node  per  survey  area.  The  second  strategy  maintains  the  number  of  candidate  nodes,  but 
eliminates  all  transition  edges  between  nodes  within  the  same  survey  area.  Since  the  embedded  routing 
problem  significantly  increases  the  complexity  of  scheduling  problem  as  a  whole,  the  general  idea  of  both 
reduction  strategies  is  to  decrease  the  routing  problem  in  intelligent  ways. 

4.1  Node  Reduction  Strategy 

We  first  attempt  to  reduce  the  number  of  variables  by  selecting  only  a  subset  of  dock  and  deploy  nodes  for 
consideration  in  the  overall  optimization  framework,  specifically  the  S  nodes  in  a  cluster  that  minimize  the 
distance  traveled  to  visit  all  nodes  in  the  subset  while  providing  a  dock  and  deploy  point  to  every  survey 
area  S.  Formally,  we  can  write  this  node  subset  as 

V  =  ( argmin  V  c%ove  \i,j  G  V* ,  V  C  V,  Vs  G  5,  (42) 

^  v*  dex>* 

ra  =  i}* 

where  T>*  is  the  number  of  nodes  connected  to  survey  areas  within  the  node  subset  T>* .  Equation  (42) 
represents  a  form  of  the  generalized  traveling  salesman  problem  (GTSP)  [31,  32].  While  the  GTSP  is 
another  NP-hard  problem,  its  use  increases  the  tractability  of  the  SATP  in  two  ways.  First,  because  the 
GTSP  is  partly  encapsulated  within  the  SATP  when  determining  routes  for  the  individual  agents,  solving  the 
GTSP  before  the  main  schedule  optimization  will  eliminate  orders  of  magnitude  more  permutations  required 
in  the  overall  optimization  than  required  to  solve  the  GTSP.  Second,  there  are  several  heuristics  that  can 
quickly  provide  efficient  but  sub-optimal  solutions  to  the  GTSP  [31,  32,  33]. 

4.2  Edge  Reduction  Strategy 

A  second  candidate  strategy  for  reducing  the  scale  of  the  SATP  is  to  reduce  the  number  of  decision  variables 
by  reducing  the  total  number  of  candidate  edges  within  the  problem.  Like  the  node  reduction  strategy,  this 
is  a  balancing  act  between  reducing  the  scale  of  the  problem  and  eliminating  options  for  the  vehicles  to  travel 
in  order  to  transit  in  an  efficient  manner.  An  appropriate  compromise  between  the  two  conflicting  needs  is 
in  the  elimination  of  intra-survey  area  edges.  Specifically,  we  create  a  new  edge  map  V'  as 

v'  =  G  V| i  G  Vai,j  G  VSj,sx  £  s/\  .  (43) 

The  intuition  behind  using  (43)  is  the  assumption  that  there  is  little  need  for  either  a  survey  UUV  or  USV 
to  transit  between  two  nodes  in  a  single  survey  area,  as  once  an  area  is  surveyed,  the  vehicles  can  simply 
choose  the  best  route  to  the  next  survey  area. 

Transit  paths  between  four  survey  areas  consisting  of  four  dock  and  deploy  nodes  per  area  are  shown  in 
Figure  2.  The  full  graph  (V,  V)  is  shown  on  the  left-hand  side  of  the  figure.  The  graph  (T>,  V')  resulting  from 
edge  reduction  strategy  is  shown  in  the  center.  The  graph  (V ,  V)  depicting  the  node  reduction  strategy  is 
shown  on  the  right.  The  edge  reduction  strategy  eliminates  approximately  18%  of  the  edges  in  the  search 
space  for  agent  movement.  The  node  reduction  dramatically  decreases  the  number  of  edges  and  nodes, 
reducing  the  number  of  edges  to  8%  of  the  full  graph. 

5  Robust  Scheduling  Extensions 

We  now  extend  our  framework  to  incorporate  uncertainty  in  the  duration  of  various  tasks  being  performed  by 
the  individual  vehicles.  Our  methodology  is  based  on  the  general  idea  that  there  is  a  significant  potential  cost 
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8000 


Full  Edge  Map 


(a)  Full  Graph. 


Edge  Reduction  Strategy  Map 


East/West  [m] 

(b)  Graph  resulting  from  edge  reduc¬ 
tion  strategy. 


East/West  [m] 

(c)  Graph  resulting  from  node  reduc¬ 
tion  strategy  involving  GTSP  algo¬ 
rithm. 


Figure  2:  Comparison  between  full  edge  transit  graph  and  graph  from  complexity  reduction  strategies.  Blue 
represents  edge  in  transit  graph  V.  Black  represents  survey  area  boundaries.  Left:  Complete  graph  from 
all  dock/deploy  points.  Center:  Graph  from  edge  reduction  strategy.  Right:  Graph  from  node  reduction 
strategy  using  the  GTSP  algorithm. 


of  schedule  slips.  Specifically,  we  define  a  schedule  slip  as  when  the  end  time  of  a  task  as  scheduled  exceeds 
the  scheduled  start  time  of  the  following  task.  Often,  schedule  slips  can  have  catastrophic  consequences 
in  real-world  multi-agent  systems.  For  example,  an  UUV  finishing  a  survey  task  on  time  while  a  USV  is 
significantly  delayed  could  loiter  in  a  dangerous  location  or  potentially  run  out  of  fuel  before  docking  could 
be  performed.  Similarly,  a  survey  UUV  being  significantly  delayed  when  expected  to  rendezvous  with  a  USV 
could  significantly  delay  the  entire  schedule  for  the  remainder  of  the  mission.  In  order  to  develop  our  robust 
scheduling  scheme  to  mitigate  these  hazards  with  our  robust  survey  UUV  transport  problem  (R-SATP) 
extension,  we  first  make  the  assumption  that  for  tasks  taken  by  the  individual  vehicles,  no  tasks  can  be 
performed  before  their  scheduled  start  time.  Formally,  this  can  be  done  by  creating  a  constraint  defining 
the  true  initial  start  time  of  a  task  as 

T^^maxjT^T^},  (44) 


where  T^rlp_1  is  the  true  start  time  of  the  previous  phase  for  vehicle  a.  We  then  define  the  true  start  time 
of  the  previous  phase  as 


rr\end  _  restart  \  ^  \  ^  ~k  jk 

^  a,p—  1  a,p  '  /  j  /  j  (  a,p,l^a,p,l  ’ 


(45) 


fc£/C  1£j0  || 


where  c*  l  is  the  true  duration  of  task  k  during  phase  p  performed  by  vehicle  a  on  location  set  l.  We  now 
constrain  from  (15)  to 


restart  restart 

a,p-\- 1  a,p 


sr  sr  rk  r k  ,  Tbuff 

/  j  /  j  a,p  i 


k^KL  l£jCk 


(46) 


where  is  a  notional  buffer  time  to  prevent  schedule  slips  that  we  will  define  with  optimization  variables 

subsequently.  Beforehand,  we  will  characterize  our  general  strategy  of  focusing  on  minimizing  schedule  slips 
during  individual  phases.  We  do  so  with  the  following  proposition,  which  shows  that  if  a  phase  is  currently 
in  a  schedule  slip  situation,  there  must  have  been  a  phase  in  the  past  where  the  a  schedule  slip  did  not  occur. 


Proposition  5.1.  If 


and 


restart  rjhend 

J_a,p  J_a,p— 1 


rjr\ start  restart  .  \  ^  \  ^  ~  y k 

^a,p  ^  a,p—  1  '  /  j  /  j  ( a,P,l*-a,p,l 

keic  ieck 


(47) 

(48) 
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then  there  exists  a  p'  <  p/p  =  0  such  that 


rpstart  restart  .  \  \  ~  j k 

*~a,p'  ^-a,p'  —  1  '  /  j  /  j  ^CL,p,l^a,p,l 

fc£/C  ZG£fc 


(49) 


Proof.  Suppose  (47)  and  (48)  are  true  for  some  p.  Since 


‘  aiP 


-  a,p—  1 

_  rr\ start  _i_  \  ^  \  ^  ~  t k 

a,p—  1  '  /  j  /  j  ^a,p,l *-a,p,l  ? 


ZcG/C 


then 


rpstart  _j_  \  A  \  ^  ~  j k  ^  rpstart  ,  \  ^  \  '  ~  jk 
■*-a,p—  1  "t"  /  j  /  j  ( a, P, l *-a,p, l  ^  Aa,p-1  '  /  ^  /  j  ^a,p,ZAa,p,Z 

ZcG/C  ZG£fc  ZcG/C  Zg£& 


rpstart  ^  rpstart 
-*-a,p— 1  1* 


Due  to  the  definition  of  and  (52), 


restart  _  rpend 

a,p—  1  a,p—  2 

_  rpsiart  i  \  ^  \  ^  ~  jk 

~  1a,p-2  “>  /  ^  v  ca,p- 2,Z1a,p-2,Z- 
ZcG/C  l€Ck 

There  are  now  three  possibilities  to  the  relation  of  to 

restart  _i_  Y''  ~  t k 

Aa,p- 1  "T-  Z^fee/C  l^leCk  La>P~2’ll-a,p-2,l- 

~\star\ 

-  a,p—  1 


restart  ^  rpstart 


a,p— 2  ~r  y  ]  ^  '  Ca,p-2,lla,p-2,l 
fcG/C  Zg£& 


rristart  ^  rristart  I  \  ^  \  ^  ~  jk 

-La,p—  1  ^  J-a,p-2  ~ v  v  ca,p-2,lLa,p,l 

k£Kl  l£/Lk 

rjnstart  rpstart  .  \  \  ^  ~  j k 

-*-a,p— 1  -*-0,^—2  "r”  /  j  /  j  (  a,P~ 2,Z *-a,p— 2,Z  • 

ZcG/C  Zg£& 

Note  that  if  (58)  holds,  then  our  proposition  would  be  proven.  Suppose  (57)  holds.  Then 


restart  ^  restart 


1  a,p—  1  s  Aa,p- 2  ^  ^  ^  >  ^a,p—  2,zla,p— 2,Z 

ZcG/C  ZG-Z2fc 

-  a.p—2  T  ^  ^  Ca,p_2,;Ia,p_2,Z 

ZcG/C  l£tC-k  ZcG/C  l£jC-f~ 


n  start 


rpstart  ^  rpstart 
a,p—  2  ^  _*'a,p—  2’ 


which  violates  our  definition  of  T**“rt  in  (44).  So  we  must  assume  that 


1,P 

rpstart  rpsZart  .  \  A  \  ^  ~  j/c 

-*-a,p— 1  -*-a,p— 2  "T"  /  ^  ^  ("Ci,p—2,l^a,p—2,l’ 

fcG/C  l£j(Lk 


(50) 

(51) 


(52) 

(53) 

(54) 

(55) 


(56) 

(57) 

(58) 

(59) 

(60) 
(61) 

(62) 


We  now  have  the  same  assumptions  as  before  for  p  —  1.  By  induction,  we  can  follow  the  same  process  to 
p  =  1.  From  initial  conditions, 


rpstari  _  rpstart  _  rv 

a,0  —  -*-0,0  —  u 

T#rt=0+£  ]Tcaio,^)(V, 

fcG/C  l£Ck 


(63) 

(64) 
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satisfying  the  proposition  for  p  =  l.D 

We  use  Proposition  5.1  as  motivation  to  focus  on  treating  the  robustication  of  schedules  as  a  Markov 
process.  Namely,  that  the  dominant  factor  in  the  probability  of  schedule  slip  in  the  current  phase  is  the 
uncertainty  in  the  action  performed  in  the  previous  phase.  In  other  words,  we  may  mitigate  the  possibility 
of  a  schedule  slip  in  the  overall  schedule  by  reducing  the  possibility  of  a  schedule  slip  in  every  phase.  Using 
(44)-(46),  we  now  characterize  our  general  strategy  for  reducing  the  risk  of  schedule  slips. 

We  assume  there  is  a  numerical  cost  (measured  in  time)  to  the  occurrence  of  either  a  schedule  slip,  or  a 
delay  beyond  the  buffer  time 


E  E  <1^,1  >  Tbuff  => cSUp  (65) 

fc£/C  l£/lk 

E  E  <iika,P,l  <  TbUff  =►  cineff 

Using  the  definition  of  Bayes  risk,  see  [34] ,  we  may  write  the  Bayes  risk  for  this  cost  definition  as 

R  =  cineffP  (cka<l  <  Tkfuf f )  Pineff  +  cslipP  [cka  l  >  Tkfuf f )  Pslip.  (66) 

for  a  particular  action  k,  where  Plneff  and  PshP  are  prior  probabilities  of  adverse  consequences  of  either 
a  delay,  or  a  schedule  slip,  respectively.  Varying  the  buffer  time  and  assuming  the  task  duration  follows  a 
ck  t  oc  Ms(0,  20)  distribution  yields  a  risk  function  related  to  buffer  time  as  shown  in  Figure  3.  Note  that 


Service  Action  Schedule  Risk  vs.  Time 


Figure  3:  Risk  function  for  Schedule  slip  based  on  Ms(0,  20)  with  cme ^  =  Tkfu^  and  cshp  =  200. 


as  written,  the  risk  function  is  invariant  in  being  performed  during  a  particular  phase  p.  As  such,  we  may 
optimize  the  nonlinear  optimization  problem 


Tk.buff*  „  •  me 

a  i  =  argmm  c 


ffp{< 


a,l 


rpk,buff 
'  ±a,l 


^  pineff  _|_  cslipp  ^~k 


rpk,buff 
l  >  1a,l 


pslip 


(67) 
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using  numerical  techniques  prior  to  the  schedule  optimization.  Once  we  develop  an  optimal  buffer  time  for 
every  action,  location,  and  agent  (type),  we  treat  them  as  parameters  in  our  robust  scheduling  extension.  If 
pmef  f  an(]  pshp  are  treated  as  uniform  priors,  (67)  may  be  re-written  as 


rpk,buff* 

±a,l 


=  argmin  cineffP  [ckal 

rpk,buff  V  ’ 

1a,l 


<  T 


k,buff 


a, l 


)  +  cslipP  (< 


a, l 


>  T 


k,buff 


a, l 


(68) 


Once  the  optimal  buffer  times  for  each  individual  task  are  found  using  (67),  they  may  be  factored  into 
the  overall  optimization  process.  In  order  to  incorporate  the  buffer  time,  we  take  advantage  of  the  necessary 
delays  between  coordinated  agent  actions  in  the  MCM  scheduling  problem  by  using  the  wait  times  as  part 
of  the  required  buffer  times.  We  define  the  synchronization  time  for  agent  a  as  the  time  difference  between 
the  end  time  of  the  current  phase  and  the  proceeding  phase,  written 


rpsync  _  restart  _  rpend 

a,p  1a,p+ 1  a,p  ' 


(69) 


We  then  ensure  that  the  decision  variable  representing  the  scheduled  buffer  time  c  is  at  least  the  desired, 
optimal  buffer  time  with  the  constraint 

ffc  _  i  *  Fpsync  ^  rpk,buff  rebuff  rpk,buff  , =psync 

La,p,l  ~  1  A  x  a,p  —  a,l  ^  L  a,p  ~  1  a,l  ~  L  a,p  ■  \IU) 

If  the  synchronization  time  is  greater  than  the  optimal  buffer  time  parameter  Tkfu^,  then  the  buffer  time 
is  set  to  zero  with  the  constraint 


T k  —  1  a  rpsync  rpk,buff  .  rpbuf  f  _ 

*-a,p,l  L  /N  a,p  —  ±  a, l  ^  ^  o,,p 


(71) 


Finally,  the  robust  start  time  is  set  as  the  deterministic  start  time  plus  the  summation  of  all  buffer  times 
with  the  constraint 


VoG^VpeP  (72) 

Hp start, R  _  Fpstart.  ,  \  '  rp buff 

*~a,p+ 1  ia,p+l  '  /  j  a,p'  ' 

p'(z'Ps.t.p'<.p 


In  order  to  complete  our  robust  scheduling  extension,  we  modify  (16)  and  (17)  to  use  the  new  robust  start 
times,  creating  the  constraints 


Va  G  A,  Mp  G  V 

(73) 

rr\end  _  ry\start  ,  \  ^  \  k  j k 

a,p  a,p  '  /  j  /  j  (~'a,l*-a,p,l 

keK,  C,eC,k 

Mm  G  Aur,Mm  G  As,Mp  G  V 

(74) 

\  ^  T dock  _  -j  restart  restart 

/  j  *-mn,p,d  ^  m,p  n,p 

dev 

Mm  G  Aur,Mm  G  As,Mp  G  V 

(75) 

\  ^  j  deploy  -|  ^  rr\end  ry\end 

/  j  Lmn,p,d  1  ^  *-m,p  in,p  * 

dev 


6  Simulation  Results 

The  following  simulations  demonstrate  the  efficacy  and  computational  performance  of  the  proposed  schedul¬ 
ing  framework.  All  simulations  were  performed  using  the  IBM’s  ILOG  CPLEX  optimization  software  [35]. 
All  simulations  were  performed  on  a  2.8  GHz  quad-core  CPU  equipped  with  8  GB  RAM. 


19 


NSWC  PCD  TR  2015-003  Optimized  Waterspace  Mgt 


6.1  End-to-End  MCM  Mission  Scheduling 

We  first  demonstrate  an  end-to-end  MCM  mission  optimization  simulation,  where  the  scheduler  automati¬ 
cally  determines  a  near-optimal  schedule  for  performing  the  survey,  reacquire  and  identify,  and  neutralization 
components  of  the  MCM  mission.  Figures  4  and  5  illustrate  a  simulation  of  TV  =  1  USV,  M  =  2  survey 
UUVs,  R  =  2  RI  UUVs,  and  P  =  26  phases.  In  the  simulations,  we  assume  that  a  USV  has  a  speed  of  8 
m/s,  while  UUVs  have  a  speed  of  1.5  m/s.  We  assume  each  of  the  S'  =  4  survey  areas  require  csurvey  =  300 
minutes.  RI  and  Neutralize  tasks  require  cRI  =  150  and  cNeut  =  60  minutes,  respectively.  Furthermore, 
we  assume  that  each  docking  action  requires  cdock  =  60  minutes,  and  cdepl°v  =  30  minutes.  Post-mission 
analysis  time  for  the  survey  and  neutralize  phases  required  cfMA  =  240  and  CrjIA  =  15  minutes,  respec¬ 
tively.  Figure  4  represents  the  spatial  map  of  the  various  vehicles  moving  throughout  the  field.  Figure  5 
shows  a  temporal  schedule  of  the  overall  mission.  The  MILP  optimization  software  was  run  with  a  cutoff 
time  of  10  minutes  for  each  optimization  phase  (Survey-RI,  and  RI-Neutralize) .  For  the  purposes  of  this 
simulation,  we  assumed  that  all  targets  were  identified  as  mines,  and  therefore  no  additional  Neutralize-only 
phase  was  necessary.  As  seen  in  the  figure,  the  USV  travels  to  the  center-most  survey  area,  survey  area  2, 
to  deploy  the  survey  vehicles.  The  USV  then  travels  to  a  further  area  to  deploy  the  RI  vehicles.  The  USV 
then  commences  to  travel  about  the  area  docking  with  various  vehicles  as  the  vehicle  travel  under  their  own 
power  to  maximize  the  speed  at  which  post-mission  analysis  commences,  including  docking  with  a  vehicle 
to  immediately  deploy  it  again  in  order  to  survey  the  final  area.  It  is  noted  that  the  UUVs  largely  use  their 
own  motion  in  order  to  actually  travel  to  areas  to  prosecute.  We  suspect  this  is  due  to  sacrificing  speed 
on-station  for  maximizing  PMA  processing  speed. 
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Figure  4:  Plot  of  vehicle  actions  within  an  MCM  area. 


6.2  Deterministic  and  Robust  Scheduling  Results 

We  now  present  a  comparison  between  the  deterministic  scheduling  of  a  single  phase  of  an  MCM  mission, 
the  survey  phase,  the  proposed  robust  scheduling  approach,  and  a  naive  robust  scheduling  approach  serving 
as  a  baseline.  In  the  following  simulations,  we  assume  that  the  USV  has  a  speed  of  8  m/s,  while  UUVs  have 
a  speed  of  1.5  m/s.  We  assume  each  of  the  S'  =  4  survey  areas  requires  csurvey  =  100  minutes.  Furthermore, 
we  assume  that  each  docking  action  requires  cdock  =  20  minutes,  and  cdeploy  =  10  minutes.  For  the  task 
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Vehicle  Schedule 


Figure  5:  Simulated  MCM  Schedule.  Gantt  chart  represents  the  duration  of  each  action  performed  by  each 
vehicle  at  a  point  in  time.  Additionally,  PMA  processing  times  are  shown  at  the  top.  Blue  PMA  times  repre¬ 
sents  PMA  for  survey  components  of  mission.  Red  PMA  times  represent  RI  components.  Green  and  magenta 
represent  vehicle  deployment  and  docking,  respectively.  Blue  represents  vehicle  transits.  Yellow  represents 
surveys.  Black  on  RI  vehicles  represents  RI  actions.  Black  on  the  USV  row  represents  neutralization  actions. 


uncertainty  distributions,  we  used  cr^ock  =  10,  a ^epioy  =  1,  Purvey  =  30,  and  o^nove  =  for  nodes  i 

and  j,  respectively.  We  present  a  direct  comparison  of  simulations  between  the  deterministic  optimization 
problem,  a  naive  robust  optimization  problem,  and  the  minimum-risk  optimization  problem  discussed  in 
Section  5  for  N  =  1  USV  and  M  =  2  survey  agents  within  P  =  12  phases.  A  naive  method  for  mitigating 
concerns  of  schedule  slips  is  typically  to  add  a  fixed  amount  of  buffer  time,  often  as  a  function  of  the 
standard  deviations.  As  such,  to  compare  the  deterministic  method  to  our  robust  counterpart,  we  provide 
a  baseline  robust  simulation  where  three  standard  deviations  of  time  required  are  added  for  each  action 
type.  For  example,  a  survey  action  could  be  planned  for  csurvey  =  116.425  seconds.  For  the  minimum-risk 
optimization  problem,  we  set  c®*]p  =  500  and  clne^  =  for  all  task  types. 

Figure  6  shows  a  comparison  of  a  scenario  using  the  three  different  optimization  problems.  As  expected, 
the  deterministic  solution  has  the  best  completion  time  of  310.9  minutes.  The  naive  robust  solution  had 
a  completion  time  of  364.9  minutes.  The  minimum-risk  robust  scheduling  solution  had  a  completion  time 
of  345.45  minutes.  Also  of  note  is  the  different  strategies  taken  by  the  determinist  or  naive  solution,  and 
the  minimum-risk  solution.  The  deterministic  and  naive  robust  solutions  both  schedule  the  USV  (labeled 
transport  agent)  to  pick  up  and  then  re-deploy  one  of  the  UUVs  (labeled  survey  agents)  to  the  two  furthest 
survey  areas.  The  minimum-risk  scheduling  algorithm  has  the  USVs  both  deployed  at  the  closest  survey 
areas,  and  one  travels  to  each  of  the  furthest  areas  under  its  own  power.  We  conjecture  that  this  is  likely 
due  to  the  minimum-risk  scheduling  taking  into  account  the  increased  schedule  slip  risk  due  to  executing  an 
increased  number  of  tasks  required  for  dock  and  re-deployment  versus  a  single  move  action.  We  empirically 
analyze  the  computational  performance  of  the  minimum-risk  robust  solution  versus  the  deterministic  solution 
in  the  following  section. 
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(a)  Deterministic  solution  action  plot.  (b)  Naive  robust  solution  action  plot. 


(c)  Minimum-risk  robust  solution  ac¬ 
tion  plot. 


Agent  Schedule 


Transport  Agent  1 
Service  Agent  1 
Service  Agent  2 


m 


i 


i  i 


Transport  Agent  1 

Agent  Schedule 

n  n 

m 

A 

Transport  Agent  1  | 

lent  Schedule 

«l 

III 

Service  Agent  1 

ii  i 

1 

Service  Agent  1 

III 

1 

Service  Agent  2 

U _ IIL 

1 

Service  Agent  2 

_ ■ 

JI 

Deploy 

Service 


-200  -100 


100  200  300  400 


Time  [min] 

(d)  Deterministic  solution  schedule. 


-200  -100 


Time  [min] 

(e)  Naive  robust  solution  schedule. 


Dock 

Deploy 

Service 


100  200  300  400 


-200  -100 


100  200  300  400 


(f)  Minimum-risk  robust  solution 
schedule. 


Figure  6:  Comparison  between  deterministic  optimization,  a  naive  robust  optimization  approach,  and  the 
proposed  minimum-risk  scheduling  approach. 


6.3  Analysis  of  Complexity  Reduction  Heuristics  and  Computational  Tractabil- 
ity 

Figure  7  shows  results  from  a  Monte  Carlo  simulation  comparing  the  full  graph  to  the  two  complexity 
reduction  strategies  discussed  in  Section  4  for  an  SATP  optimization  problem  involving  N  =  1  USV,  M  =  3 
survey  UUVs,  and  P  =  15  phases.  Sixty  simulations  were  performed  with  three  survey  areas  randomly 
generated  over  a  10,000m  by  10,000m  field.  For  each  simulation,  the  cutoff  time  for  providing  the  best 
solution  from  CPLEX  for  the  optimization  problem  was  120  seconds.  As  seen  in  the  figure,  the  node 
reduction  strategy  resulted  in  the  fastest  sortie  time,  averaging  an  objective  value  (31)  of  281.4  minutes. 
The  edge-reduction  strategy  provided  a  modest  improvement  over  the  full  optimization  model,  averaging  an 
objective  value  of  373.1.  As  expected,  the  full  optimization  model,  due  to  its  significantly  greater  complexity, 
resulted  in  the  poorest  solutions,  averaging  398.8  minutes.  Additionally,  in  some  instances,  the  optimization 
solver  was  unable  to  find  a  solution  to  the  problem  in  the  specified  time  limit  due  to  the  computational 
complexity  of  the  full  optimization  problem.  The  timeout  without  a  solution  occurred  in  48%  of  the  full 
optimization  problem’s  simulations,  and  23%  of  simulations  using  the  edge  reduction  strategy.  However,  in 
no  instances  of  using  the  node  reduction  strategy  did  a  timeout  occur. 

A  second  Monte  Carlo  simulation  was  performed  involving  sixty  simulations  of  a  single  USV  and  survey 
UUV,  and  P  =  12  phases,  allowing  all  optimization  routines  to  run  until  a  globally  optimal  solution  is  found. 
In  this  manner,  the  Monte  Carlo  simulation  characterized  how  the  two  heuristic  reduction  strategies  (42) 
and  (43)  impact  the  discovery  of  the  globally  optimal  schedule  by  eliminating  transit  paths.  In  all  cases, 
the  globally  optimal  solution  to  the  full  optimization  problem  was  identical  to  the  globally  optimal  solution 
of  the  edge  reduction  strategy  (43).  The  node  reduction  strategy  (42)  did  eliminate  the  globally  optimal 
solution  in  all  instances,  however  this  elimination  increased  the  minimum  solution’s  objective  function  value 
by  an  average  of  only  3.45%,  and  maximum  of  7.7%. 

In  addition  to  the  simulations  characterizing  the  effects  of  the  various  heuristics  on  the  quality  of  the  opti- 
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Figure  7:  Histogram  of  Monte  Carlo  simulations  showing  minimum  objective  value  found  by  different  com¬ 
plexity  reduction  strategies. 


Histogram  of  Fastest  Schedule  for  Different  Complexity  Reduction  Strategies 


23 


NSWC  PCD  TR  2015-003  Optimized  Waterspace  Mgt 


Table  1:  Characterization  of  phase,  agent  count,  and  optimization  type  effects  on  optimization  quality. 
Times  represent  the  schedule’s  objective  value  in  minutes  after  5  minutes  optimization  runtime. 


Agent 

Count 

Opt. 

Type 

#  of  Phases 

12 

14 

16 

18 

20 

2 

Standard 

522.86* 

507.72* 

507.72* 

507.72* 

507.72* 

Robust 

567.61 

559.12 

567.61 

567.61 

567.61 

3 

Standard 

297.72 

295.16 

303.57 

303.57 

303.57 

Robust 

337.12 

338.01 

349.68 

360.39 

384.61 

4 

Standard 

244.14 

244.14 

244.14 

243.55 

244.14 

Robust 

377.62 

314.24 

333.70 

373.47 

390.96 

5 

Standard 

271.01 

271.01 

281.41 

296.43 

275.08 

Robust 

313.44 

513.45 

389.08 

409.80 

277.45 

6 

Standard 

173.02 

188.44 

236.97 

171.41 

192.42 

Robust 

424.41 

379.24 

275.37 

435.07 

mization  result,  we  wish  to  characterize  the  effect  of  increasing  number  of  (binary)  optimization  variables  on 
the  optimization  framework  while  using  the  node  reduction  heuristic.  Table  1  shows  a  number  of  simulations 
that  vary  the  agent  count  and  number  of  phases  in  which  to  schedule  tasks.  Additionally,  the  simulations 
show  the  difference  between  the  robust  versus  deterministic  SATP  case,  and  the  result  of  the  optimization 
yielding  the  best  solution  from  CPLEX  after  300  seconds  of  run-time.  If  computational  tractibility  were 
irrelevant,  the  best  solution  would  be  found  from  the  problem  formulation  involving  the  most  phases  and 
agents,  since  the  scheduler  could  take  advantage  of  tasking  multiple  agents,  and  slotting  those  agents  in 
complicated  task  schedules.  Asterisks  represent  where  the  globally  optimal  schedule  was  found  within  the 
cutoff  time.  For  A  >  5  agents,  two  USVs  were  used  in  the  scenario.  As  seen  in  the  table,  the  lower  number 
of  phases/agents  resulted  in  a  higher  likelihood  of  the  globally  optimal  schedule  being  found.  As  a  larger 
number  of  agents  and  phases  were  used,  the  results  were  generally  sub-optimal,  and  at  times  a  worse  result 
than  a  fewer  number  of  phases  for  the  same  agent  number,  due  to  the  dramatically  increased  search  space 
due  to  the  binary  variables.  From  the  table  and  judging  by  the  variation  in  best  search  schedules  returned 
before  the  cut-off  time,  reasonable  numbers  of  agents  for  the  SATP  scheduling  problem  as  developed  appears 
to  be  four  agents  or  fewer  for  the  deterministic  schedule,  and  three  agents  and  fewer  than  P  =  14  phases  for 
the  robust  scheduling  extension.  The  only  scenario  that  failed  to  produce  a  result  was  the  most  complicated 
of  six  agents  and  20  phases  using  the  robust  extension,  which  created  an  optimization  problem  of  26,190 
binary  variables.  This  compares  to  the  simplest  scenario,  which  created  an  optimization  problem  of  869 
binary  variables. 


7  Conclusions 

We  have  presented  a  framework  for  optimizing  scheduling  of  multiple  heterogeneous  vehicles  in  order  to 
perform  an  MCM  survey  mission.  The  framework  utilizes  mixed-integer  linear  programming  in  order  to 
obtain  optimal  to  near-optimal  solutions,  depending  on  the  complexity  of  the  problem  and  length  of  time 
the  algorithm  is  allowed  to  search  for  a  result.  Future  work  includes  extending  the  framework  to  re- acquire 
and  identify  tasks,  as  well  as  neutralization  tasks.  Additionally,  adding  probabilistic  constraints  in  order  to 
account  for  uncertainty  in  the  length  of  time  required  will  be  explored.  Lastly,  we  hope  to  explore  the  use 
of  pre-processing  heuristics  to  reduce  the  size  and  complexity  of  the  MILP  problem  and  lead  to  faster  solve 
times  for  efficient  solutions.  The  framework  utilizes  mixed-integer  linear  programming  techniques  to  create 
novel  constraints  required  to  develop  a  tasking  schedule  for  the  vehicles  including  docking,  deployment, 
and  movement  actions  performed  throughout  the  area.  A  robust  scheduling  technique  was  introduced  that 
extends  the  framework  to  include  probabilistic  constraints  in  order  to  account  for  uncertainty  in  the  length 
of  time  required  to  perform  tasks,  as  well  as  exploring  decentralized  optimization  strategies  to  increase  the 
scalability  of  the  optimization  framework.  Future  work  includes  developing  a  decentralized  optimization 
process  to  make  the  scheduling  algorithms  more  computationally  tractable  for  larger  numbers  of  vehicles. 
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